Provenance Tracking in an Earth Science Data Processing System
نویسندگان
چکیده
NASA and other organizations involved with climate research have captured huge archives of earth observations. The sensors, spacecraft, and science algorithms for transforming and analyzing the data and the processing frameworks are evolving over time. Science Data Processing Systems (SDPSes) should capture, archive, and distribute provenance information of all externally received data and algorithms, as well as describing all internal processes used for data transformation. This will make the data sets produced by the systems easier to understand, enable independent scientific reproducability, and ultimately, increase the credibility of the scientific research that makes use of those
منابع مشابه
Tracking provenance of earth science data
Tremendous volumes of data have been captured, archived and analyzed. Sensors, algorithms and processing systems for transforming and analyzing the data are evolving over time. Web Portals and Services can create transient data sets on-demand. Data are transferred from organization to organization with additional transformations at every stage. Provenance in this context refers to the source of...
متن کاملDistinguishing Provenance Equivalence of Earth Science Data
Reproducibility of scientific research relies on accurate and precise citation of data and the provenance of that data. Earth science data are often the result of applying complex data transformation and analysis workflows to vast quantities of data. Provenance information of data processing is used for a variety of purposes, including understanding the process and auditing as well as reproduci...
متن کاملES3: A Demonstration of Transparent Provenance for Scientific Computation
The Earth System Science Server (ES3) is a software environment for data-intensive Earth science, with unique capabilities for automatically and transparently capturing and managing the provenance of arbitrary computations. Transparent acquisition avoids the scientist having to express their computations in specific languages or schemas for provenance to be available. ES3 models provenance as r...
متن کاملComputational provenance in hydrologic science: a snow mapping example.
Computational provenance--a record of the antecedents and processing history of digital information--is key to properly documenting computer-based scientific research. To support investigations in hydrologic science, we produce the daily fractional snow-covered area from NASA's moderate-resolution imaging spectroradiometer (MODIS). From the MODIS reflectance data in seven wavelengths, we estima...
متن کاملSystem Transparency, or How I Learned to Worry about Meaning and Love Provenance!
Web-based science analysis and processing tools allow users to access, analyze, and generate visualizations for vast amounts of data without requiring the user to directly manage the data or the data analysis processes or understand the limits on the underlying data. These tools have the potential to provide a significant productivity increase to science users of all levels of experience as wel...
متن کامل